Automating and Validating Semantic Annotations
نویسندگان
چکیده
Use-case diagrams (UCDs) are widely used to describe requirements and desired functionality of software products. However, UCDs are loosely linked to the source code, and there are no approaches to maintain the correspondence between program variable and types (or program entities) and elements of UCDs. We offer a novel approach for automating a part of the process of annotating program entities with names of elements from UCDs. Developers first annotate an initial set of a few program entities. Our LEarning ANnnotations (Lean) system combines these annotations with run-time monitoring, program analysis, and machinelearning approaches to discover and validate annotations on unannotated entities in Java programs. We evaluate our prototype implementation on open-source software projects and our results suggest that Lean can generalize from a small set of annotated entities to annotate many other entities.
منابع مشابه
Automating and Validating Program Annotations
Program annotations help to catch errors, improve program understanding, and specify invariants. Adding annotations, however, is often a manual, laborious, tedious, and error prone process especially when programs are large. We offer a novel approach for automating a part of this process. Developers first specify an initial set of annotations for a few variables and types. Our LEearning ANnnota...
متن کاملValido: A Visual Tool for Validating Sense Annotations
In this paper we present Valido, a tool that supports the difficult task of validating sense choices produced by a set of annotators. The validator can analyse the semantic graphs resulting from each sense choice and decide which sense is more coherent with respect to the structure of the adopted lexicon. We describe the interface and report an evaluation of the tool in the validation of manual...
متن کاملFrom Legacy Documents to XML: A Conversion Framework
We present an integrated framework for the document conversion from legacy formats to XML format. We describe the LegDoC project, aimed at automating the conversion of layout annotations layout-oriented formats like PDF, PS and HTML to semantic-oriented annotations. A toolkit of different components covers complementary techniques the logical document analysis and semantic annotations with the ...
متن کاملThe Role of Spatial Relations in Automating the Semantic Annotation of Geodata
How can the usability of distributed and heterogeneous geographic data sets be enhanced? Semantic interoperability is a prerequisite for effectively finding and accessing relevant data in different application contexts. By using geospatial domain ontologies and semantic annotations of geodata based on these ontologies semantic interoperability can be achieved. However, since no automated method...
متن کاملTraitements automatiques pour la migration de documents numériques vers XML
More and more companies are migrating their legacy document management systems toward XML format, the industrial standard for data exchange. In order to reduce the migration cost we propose an approach aimed at automating the conversion of layout-oriented documents to semantic-oriented annotations. The conversion module uses supervised machine learning techniques to learn a conversion model for...
متن کامل